The Computer as von Neumann Planned It 


M. D. Godfrey 
D. F. Hendry 


Address for correspondence: 

Michael D. Godfrey 
michaeldgodfrey@gmail.com 


Updated version of publication: IEEE Annals of the History of Computing, vol. 15, 
no. 1, 1993, pp. 11-21. 

Date: 20 August 2019 

Correction history: 

26 February 2005: A clarification has been provided in Section 4.2 in order to explain 
the “inhibitor” symbol used in the inverter. The reference above to the published copy 
of this paper was corrected(l). And, since after this paper was published the edited 
“First Draft Report on the EDVAC” was published, a reference to this publication is 
provided in [1] von Neumann. 

15 June 2006: The date of publication of the Weik Report [9] was corrected from 
1951 to 1955. 

1 December 2010: While correcting a few typographical errors I took the opportunity 
to rearrange the Tables, Figures and text to be more readable and more like the 
layout of the original publication. 

20 August 2019: Edits, particularly concerning Babbage’s paper [14] Babbage on 
formal logic for the expression and verification of computer design, and discussion of 
the the NPL Pilot Ace. 



The Computer as von Neumann Planned It 


M. D. Godfrey 
D. F. Hendry 


ABSTRACT 


We describe the computer which was defined in von Neumann’s unpublished 
paper First Draft of a Report on the ED VAC, Moore School of Electrical 
Engineering, University of Pennsylvania, June 30, 1945 [1] von Neumann. 
Motivation for the architecture and design is discussed, and the machine is 
contrasted with the EDVAC which was actually constructed. 


Keywords: architecture, computer, EDVAC, stack, tagged-memory, von Neu¬ 
mann architecture. 
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1. Introduction 


John von Neumann made a key contribution to the understanding and development 
of computer architecture and design in his unpublished report titled First Draft Re¬ 
port on the EDVAC [1]. (the acronym is defined in [10] Eckert & Mauchley] to be: 
Electronic Discrete VAriable Computer) However, in reading publications which re¬ 
fer to this report and to the EDVAC computer which it described some perplexing 
observations emerge: 

1. The constructed EDVAC is usually described as being based on the von Neumann 
Report [1] von Neumann. 

2. The von Neumann Report is often described as the collective work of the Moore 
School group, unfairly given the sole authorship of von Neumann (see, for exam¬ 
ple, page xv of [11] Aspray. This would suggest that many of the ideas in the 
Report were shared by the Moore School design group and therefore would be 
expected to appear in the constructed machine. 

3. The EDVAC has been described on numerous occasions, but these references 
do not agree about basic facts: For example, a key feature of any computer is 
the size of each word in the addressable memory. On this subject, [2] Goldstine 
indicates 40 bits, others ([3] Burks) state 32 bits. The only known publication 
giving the correct value (44 bits) is [6] Knuth. The BRL Report [9] Weik (which 
is well-known but was never published) also has the correct value. 

Some of the evident confusion stems from the failure to distinguish between the “ED¬ 
VAC” as described by von Neumann in the Report, and the “EDVAC” as constructed 
at the Moore School. While copies of the von Neumann Report were informally cir¬ 
culated at the time it was written, the Moore School design documents by Eckert 
and Mauchly were kept private and were in fact classified and marked “CONFIDEN¬ 
TIAL.” The main document ([10] Eckert & Mauchly) was changed to “unclassified” 
in 1947.) The confusion has been aggravated by the fact that von Neumann’s Report 
has been reprinted until recently in incomplete or inaccurate forms. An accurate 
transcription is: ([1] von Neumann). 

The main purpose of this paper is to present the architecture given in the von 
Neumann Report ([1] von Neumann) in a form which is accessible to a wider audience, 
and to translate into modern terminology the formal machine definition given in the 
Report. We also compare this definition to the definition of the constructed EDVAC 
system. In doing this we hope to clarify important, but previously unrecognized, 
features of the von Neumann design and to clarify a number of the confusions that 
have arisen over the years. The most substantial description of the Moore School 
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ED VAC is given in ([10] Eckert & Mauchly. The Sections of ([10]) which specify 
the Moore School EDVAC in detail were written by Harry Huskey, who has also 
been most helpful in several discussions about the work at the Moore School. The 
accompanying paper ([13] Williams) in this issue reviews the actual performance of 
the single constructed EDVAC that was delivered to BRL. 

However, this paper in no way replaces the original von Neumann Report. Our 
purpose is only to make clear the definition of the EDVAC machines, and to clarify the 
origins of these definitions. The von Neumann Report contains a wealth of insight and 
analysis which is still not available elsewhere. Few people have had the opportunity 
to read and decipher the original typescript. One person who has understood the von 
Neumann design, particularly from the programming standpoint, is Donald Knuth. 
His paper ([6] Knuth describes the main features and instruction code of the von 
Neumann design, and also discusses improvements that von Neumann developed after 
he had drafted the Report. It was von Neumann’s intent that these improvements 
(most prominently a 32-word register file to replace the 3-register stack) should be 
incorporated into the Report. This was never done. The improvements were based 
on results from a sort program which von Neumann wrote to test the effectiveness of 
his design. A prominent feature of the Report is von Neumann’s recognition that his 
computer would not perform relatively efficiently on sorting problems. This remains, 
substantially, an unsolved problem to this day. 

Unfortunately, reading the Report is made difficult due to the incomplete draft 
form of the original, and the propagation of accumulating errors in the versions that 
have appeared in print [4 von Neumann, 7 Stern, 11 Aspray]. These reprints have 
carried over the original errors, and introduced new errors. Since one reprinting [11 
Aspray] was based on a previous reprint [7 Stern], rather than on the original, further 
compounding of typographical errors has occurred. The incomplete copy in [4] is 
not very useful as it only includes the first 5 introductory Chapters. The inaccurate 
copies in [7 Stern] and [11 Aspray] make von Neumann’s original intent quite hard 
to discover mainly because of numerous mistakes in the mathematical notation. A 
typical paragraph in [7 Stern] (second paragraph, pg. 239) reads: 

Thus each DLA organ has now a number p — 0,1,..., 255 (or 8-digit 
binary), and each minor cycle in it has a number p = 0,1,...,31 
(or 5-digit binary). A minor cycle is completely defined within M by 
specifying both numbers i, p. Due to these relationships we propose 
to call a DLA organ a major cycle. 

This should have read: 

Thus each DLA organ has now a number p — 0,1,..., 255 (or 8-digit 
binary), and each minor cycle in it has a number p = 0,1,..., 31 
(or 5-digit binary). A minor cycle is completely defined within M by 
specifying both numbers p, p. Due to these relationships we propose 
to call a DLA organ a major cycle. 

A few pages later (page 242) the notation switches from p to u, but then later (page 
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243) p is switched to p since the typist stopped typing p and wrote in p by hand. 
The fact that the first reprint [4] contains only the first 5 of 15 Chapters has led 
to additional confusion. For example, the March 1992 issue of Computing Reviews 
contains a review of William Aspray’s John von Neumann and the Origins of Modern 
Computing. The reviewer states: “(Perhaps the lack of publication accounts for 
discrepancies between the author’s quotes and the version of the Report appearing 
on pages 355-364 of a book edited by Brian Randcll... ” The reviewer is obviously 
unaware of the fact that Aspray was referring to the full 15-chapter Report, not the 5 
chapters reprinted in Randcll [4 von Neumann] . The reviewer goes on to draw further 
conclusions about von Neumann’s role in computer development based on the belief 
that the Report contained only the 5 introductory chapters, when in fact all of the 
substance of the EDVAC design and architecture as expressed by von Neumann is 
contained in Chapters 6 through 15. 

The original manuscript from the Moore School is easier to read than the pub¬ 
lished versions since it has fewer errors and it is easier to identify obvious typographi¬ 
cal mistakes, ffowever, as a first draft, it contains a great many typographical errors, 
particularly in the mathematical and special symbols. I have prepared a corrected 
version which reconstructs what was surely von Neumann’s intended text. This ver¬ 
sion has not been published, but it is hoped that this will be possible in the future. 
In doing this work I converted the manuscript to T^X form so that it could be easily 
managed and made ready for publication. This also had the effect that this version 
is easier to read because of the improved typography. (Note added 26 February 2005: 
This version has been published. See [[1] von Neumann].) Needless to say, the Report 
is a brilliant piece of work. All contemporary computer projects made use of material 
from the Moore School, typically including a copy of the Report. Alan Turing [12] 
explicitly based his computer design on the Report. However, curiously, the com¬ 
puter built under von Neumann’s guidance at the Institute for Advanced Study did 
not follow the architecture or design principles of the von Neumann EDVAC. This 
fact deserves further study. (It is of course well-known that von Neumann’s focus 
of interest had by then moved to other subjects, including new work on computer 
theory.) 

We hope that both the availability of a corrected text and this introductory guide 
will make this key contribution more accessible. 
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2. The Two EDVAC’s 


That the EDVAC described by von Neumann (to be referred to as vN-EDVAC) and 
the EDVAC constructed at the Moore School (to be referred to as M-EDVAC) are very 
different in architecture and design will become clear below. It would be interesting 
to know how these differences arose, especially since the IAS machine [8] was closer 
to M-EDVAC than it was to vN-EDVAC. Von Neumann did not, after an initial 
period, get along very well with some members of the Moore School group due to 
both technical and other disagreements. It would appear that he wrote the Report 
as an effort to state the architecture and design as he imagined it at that time. The 
Report was apparently written while von Neumann was at Los Alamos and delivered 
to the Moore School in handwritten form. (Goldstine is reported to have said that he 
had a copy of the handwritten draft, but no such copy has been found in his archives 
at Hampshire College.) It was typed at the Moore School, but there is no evidence 
that von Neumann proof-read the result. The Report in any case had little ultimate 
impact on Eckert and Mauchly and the rest of the Moore School design team who 
designed M-EDVAC as they wanted it. (This account is based on conversations with 
Harry Huskey.) 

M-EDVAC was a serial, synchronous, 44-bit word, 4-address (three operand ad¬ 
dresses and the next instruction address), binary machine with 12 operation codes. 
It had 4 registers, but these do not appear to have been addressable. It used parallel 
comparison of the two arithmetic units for error checking. (See Williams [13] for de¬ 
tails about the realized reliability and performance of the machine.) These and other 
features of the machine are summarized in Table 1 which is an excerpt from a BRL 
report by Weik[9]. 

vN-EDVAC was a serial, synchronous, 32-bit word, zero-address, binary machine 
with a hierarchical operation-code structure of 8 basic codes, 10 sub-codes, and one 
modifier. It had 3 non-addressable registers, organized as a stack mechanism. Tagged 
memory was used to distinguish instructions from data. This feature is more fully 
explained in Section 3.4. 

The main features of the two designs are compared in Table 2. 
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Table 1: M-EDVAC 


EDVAC 

Specifications from Martin Weik, BRL 
Report No. 971. 

Manufact urer 

Moore School of Electrical Engineering 
University of Pennsylvania 


2880 microsec (min 2208 max 3552) 

Divide time (including storage access): 2930 
microsec (min 2256 max 3600) 

Construction: Vacuum tube and Diode gates 
Number of rapid access word registers: 4 
Basic pulse repetition rate: 1.0 megacycle/sec 
Arithmetic mode: Serial 

Storage 


Operating Agency 

U.S. Army Ordnance Corps Ballistic 
Research Lab, APG 

General System 

Applications: solution of ballistic equations, 
bombing and firing tables, fire control, data 
reductions, related scientific problems. 

Timing: Synchronous 

Operation: Sequential 

A general purpose computer which may be 
used for solving many varieties of 
mathematical problems. 


Media 

Words 

Microsec 

Access 

Mercury Acoustic 

Delay Line 

1024 

48-384 

Magnetic Drum 

4608 

17,000 


Includes relay hunting and closure. The 
information transfer to and from the drum is 
at one megacycle per second. The block 
length is optional from 1 to 384 words per 
transfer instruction. 

Input 

Media Speed 


Numerical System 

Internal number system: Binary 
Binary digits per word: 44 
Binary digits per instruction: 4 
bits/command, 10 bits each address 
Instruction per word: 1 
Total no. of instructions decoded: 16 
Total no. of instructions used: 12 
Arithmetic system: Fixed-point 
Instruction type: Four-address code 
Number range: — (—2 -43 ) < x < 

(1 - 2 -43 ) 

Arithmetic Unit 

Add time (including storage access): 864 
microsec (min 192 max 1536) 

Multiply time (including storage access): 


Photoelectric Tape 


Reader 

942 

sexadec 

chars/sec 


78 

words/sec 

Card Reader(IBM) 

15 

rows/sec 


100 

cards/min 

Output 

Media 

Speed 

Paper Tape Perf. 

6 

sexadecimal 

chars/sec 


30 

words/min 

Teletypewriter 

6 

sexadecimal 

chars/sec 


30 

words/min 

Card Punch (IBM) 

100 

cards/min 


800 

words/min 
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Table 1-cont: M-EDVAC 


Number of Circuit Elements 


Tubes: 

3563 


Tube Types: 

19 


Crystal diodes: 

8000 


Magnetic elements: 

1325 

(relays, coils 
and trans) 

Capacitors: 

5500 

approx. 

Resistors: 

12000 

approx. 

Neons: 

320 

approx. 


Checking features 

Fixed comparison— Two arithmetic units 
perform computation simultaneously. 
Discrepancies halt machine. 

Paper tape reader error detection. 

Physical Factors 


Power Consumption: Computer, 50 kW 

Space occupied: Computer, 490 sq. ft. 

Total weight: Computer, 17,300 lbs. 

Power consumption: Air Cond., 25 kW 

Space occupied: Air Cond. 6 sq. ft. 

Total weight: Air Cond. 4345 lbs. 

Capacity: Air Cond. 20 tons 


Manufacturing Record 

Number produced: 1 
Number in current operation: 1 

Cost 

Rental rates for additional equipment: 

IBM card reader $82.50 

IBM card punch $77.00 

Approximate cost of basic system: $467,000 


Personnel Requirements 

Daily Operation: 3 8-hour shifts. No. of 
Tech: 8. 

7 days/week 

No engineers are assigned to operation of the 
computer, but are used for design and 
development of improvements for the 
computer. The technicians consult with 
engineers when a total breakdown occurs. 

Reliability and Operating Experience 

Average error free running period: 8 hours 

Operation ratio: 0.79. Good time: 130.5 hrs. 

(Figures for ’55) Attempted to run: 166 
hrs./wk. 

No. of different kinds of plug-in units: 3 

No. of separate cabinets (excluding power and 
air cond.): 12 

Operating ratio figures for 1954: 

Operating ratio: 0.79. Good time: 129 hrs. 
Attempted to run: 163 hrs./wk. 

Additional Features and Remarks 

Oscilloscope and neon indicator for viewing 
contents of any storage location at any time. 

Exceed capacity options: halt, ignore, transfer 
control, or go to selected location. 

Unused instruction (command) halt. 

Storage of previously executed instruction and 
which storage location it came from, for 
viewing during code checking. 

Storage of current instruction and storage 
location from which it originated. 

Address halt when prescribed address appears 
in any of 4 addresses of instruction to be 
executed by computer. 

Tape reader error detection. 
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Table 2: vN-EDVAC vs. M-EDVAC 

Feature: 

vN-EDVAC 

M-EDVAC 

Timing: 

Basic Design 

Synchronous 

Synchronous 

Operation: 

Sequential 

Sequential 

Internal number system: 

Numerical System 

Binary 

Binary 

Binary digits per word: 

32 

44 

Data bits per word: 

31 

32 

Memory tag bits: 

1 

0 

Bits / command: 

3+5 

4 

Binary digits per address: 

13 

10 

Instructions per word: 

1 

1 

No. of instructions decoded: 

8+16 

16 

No. of instructions used: 

8+10 

12 

Arithmetic system: 

Fixed-point 

Fixed-point 

Instruction type: 

Zero-address code 

Four-address code 

No. of registers: 

3 (non-addressable) 

4 (non-addressable (?)) 

Number range: 

_(_2- 30 ) < x < (1 - 2" 30 ) 

— (—2 —43 ) < x < (1 - 2 -43 ) 

Media 

Storage 

Words 

Microsec 

Words Access 

Mercury Acoustic 

Delay Line 

8192 

1024 48-384 

Magnetic Drum 


4608 17,000 


Number of Circuit Elements 

Tubes: 

2000-3000 (est) 

3563 

Tube Types: 


19 

Crystal diodes: 


8000 

Magnetic elements: 


1325 (relays, coils 

Capacitors: 


and trans) 

5500 approx. 

Resistors: 


12000 approx. 

Neons: 


320 approx. 
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3. vN-EDVAC Architecture 


Throughout the Report von Neumann mentions the need to develop the structure of 
the system giving consideration to both design and architecture issues. The inter¬ 
action of time and space and the need for locality in time and space are repeatedly 
discussed. These issues arise particularly in the determination of the size and perfor¬ 
mance of the delay line memory and in choices of primitive operations. 

One may subdivide architecture into standard categories: addressing, instruction 
definition, protection, interrupt control, and input-output. Only one aspect of the 
last three categories is defined in the Report. Instruction memory (words tagged 
as containing instructions) was protected against modification of any fields except 
the address field. It is not explained how memory could be initially loaded with 
instructions. However, this was presumably a part of the I/O system. 

3.1 Addressing Structure 


All address values are included in the load, store and control transfer instruction 
fields or are based on the value of the instruction address register (PC). Address 
modification is carried out by computation of the desired address and then storing 
the address into the address field of the appropriate instruction in memory. All 
addresses are given as a variable pair [/x, p\ but this is purely for design reasons. All 
addresses are 13-bit, word addresses. 

3.2 Number Representation and Arithmetic Operation 


All numeric data (termed standard numbers) are 31-bit signed binary integers. 
The rightmost (first) bit in the 32-bit word is the tag bit, with zero meaning that 
the word contains a standard number. (Memory locations are taken to be increasing 
to the left.) Data are stored least significant bit first, sign bit following (to the left 
of) the most significant bit and with the binary point taken to be between the most 
significant and the sign bit. Negative numbers are in two’s-complement form. Thus, 
the range of standard numbers is — 1 < n < 1 with a precision of approximately 8 
decimal digits. At this point, and throughout the Report, despite a couple of switches 
of notation, von Neumann is clearly a “little-endian” (See [5]). All data are arranged 
so that the least significant bit is “first.” 
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Standard twos complement arithmetic is provided for addition, subtraction, multi¬ 
plication, division, and square root. Rounding is provided by computing an additional 
check bit and “rounding to the nearest odd digit.” This was done to avoid carry due 
to rounding. No provision is made for detecting out-of-range results for addition, 
subtraction or division. 

3.3 The Central Arithmetic (CA) Unit 


The CA contains three registers: I ca , J ca , and O ca ■ Ica is the input register and 
may be viewed as the top-of-stack register. J ca is the second word of the stack, but 
may also be the source register for transfers within CA. O ca receives the output of 
operations which use I ca and J ca as inputs. It always acts as an accumulator, i.e. all 
results are formed by: 

result + O ca -v O ca . 

However, the store operations optionally allow clearing of O ca after the store opera¬ 
tion. All store operations store O ca . The interconnection of these registers is shown 
in Figure 1. 



Figure 1: CA Diagram 


This Figure is a more complete version of Figure 17 in the Report. The only way in 
which data enter the CA is by being loaded into I ca . This always causes the previous 
contents of I ca to be pushed into J ca . The only path for data out of the CA is from 
O ca ■ Operation of the binary operators, then, involves a sequence of the form: 


LOAD 

M[j] 

Ica ^ Jca-> M [j ] ^ Ica 

LOAD 

M[k] 

Ica ^ Jcai A7 [/c] y I ca 

OP 



STORE 

M[r] 

O ca —> M[r], optionally clear O ca 
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In operations such as the above, I ca and O ca are implicitly addressed, as in stack- 
based or zero-address systems. However, instructions are also available to cause the 
transfers: 


Ica t 0 C a, 

Jca t O ca -, 
Oca t Ica, 


as indicated in Figure 1. Thus, for example, a program segment to compute 

4 

s = Xiyi 

i =1 

for literal data Xi and m could be: 


0 

Load zero. 

X 

Clear accumulator. 

X\ 

Implicit load immediate x\ 

m 

Implicit load immediate yi 

X 

Multiply and accumulate. 

X2 


V2 

Repeat 

X 

for 

Xs 


V3 

remaining data. 

X 


X4 


2/4 


X 


— > M[addr(S )] 

Store result at address of S 


A more fully parameterized procedure for computing inner products could be con¬ 
structed using data address computations and loop control constructs. The lack of 
any address indexing mechanism or index registers causes array referencing to require 
additional instructions, as is true of many “modern” RISC (Reduced Instruction Set 
Computer) designs. 

3.4 Instruction Definition 

The definition and operation of the Central Control (CC) and Central Arithmetic 
(CA) sections of the vN-EDVAC are described in Chapters 11 and 13 through 15 of 
the Report. A full understanding of these Chapters requires some effort. Table 3 
provides a glossary of the main components of the processor: 
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Table 3: Glossary 


CA 

CC 

PC 

SG 

A 

E-element 

M 

R 

I 

O 

L 

s 

gr 

Mi 

= W = 

minor cycle 
major cycle 


Central Arithmetic-Logic Unit 
Central Control Unit 

Program Counter (address of current instruction) 
Switching and Gating Unit 
feedback amplifier 


dl 


gate 

Memory 

External storage 
Input channel 
Output channel 

Memory read (L 0 ) and write (Li) lines 
dl select line 


k unit delay 


1-bit memory 
k-bit (serial) memory 
32-bit word 
32 words of memory 


The CC section is based on a conventional instruction sequencing mechanism for 
normal instruction processing. Given that the address in PC points to the current 
instruction in memory, the instruction at that address is fetched, decoded, executed, 
PC is incremented, and the operation cycle is repeated. (Note that M-EDVAC loaded 
the next PC value from a held in the current instruction word as is common in many 
microcode systems.) There are two exceptions to this standard instruction processing 
loop. First, is a control transfer instruction which loads PC with a new address, von 
Neumann discussed the possibility of an execute-remote operation as a “transient 
transfer,” but decided against implementation. Second, if when an instruction word 
is fetched, it is found that the instruction tag bit is clear an implicit load-immediate 
instruction is executed. Thus, the contents of the word addressed by PC are loaded 
into I ca and the PC is incremented in the normal way. 

In the Report, the operations within CA and the load and store operations are 
described first. These (termed the unpooled orders) are not the actual machine in¬ 
structions, but are distinct functional components of the instructions. Tables 4 and 
5 summarize the notation and meaning for the unpooled orders. The actual instruc¬ 
tions are termed the pooled orders and are summarized in Table 6. As can be seen, 
there are 8 instruction types ( t'). The CA instructions use the sub-code field w, and 
the CA-store instructions use w and the modifier c. The main reason given for using 
the pooled orders as the actual machine instructions was the improved bit utilization 
in the instruction fields. As can be seen by comparing the unpooled orders in Table 
5 with the pooled orders in Table 6, the only lost functions are 5, e, and 9 taken as 
separate operations. Since those operations would normally follow an a operation, 
the pooling seems natural. While the bit utilization of instruction words is still quite 
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low (the minimum number of unused bits is 10), von Neumann remarks that code 
space is likely to be small as compared to data space, and room should be left for 
expansion of the address held. Such reasoning and foresight would have been helpful 
to recent and current microprocessor designs. Table 6 is arranged using the layout 
that might have been used by von Neumann, since he generally referred to the fields 
in the instructions using a layout of least significant bit at the right. (However, for 
numerical data, the 31 bits ip.. 131 , were usually referred to in left to right order even 
though they were stored “least significant bit first.”) The instruction formats could 
equally have been drawn with the bit order reversed so that they would read more 
naturally from left to right. However, this arrangement emphasizes the bit-serial, 
“little-end” first structure of the machine. 


Table 4: Notation for Instruction Fields 

io 

- tag bit 


0 - data 


1 - instruction 

t 

- unpooled instruction code 

t' 

- pooled instruction code 

w 

- operation sub-code for CA (stack) operations 

c 

- modifier for store operations 


0 - clear O ca 


1 - retain value in O ca 

p 

- major cycle (delay line) address 

p 

- minor cycle (word within delay line) address 


The two means of carrying out load-immediate operations deserve a comment. 
First, if a data word (i 0 = 0) is encountered during instruction processing an implied 
load of the contents of the word is carried out. In addition, the 7 instruction (the 
second instruction in Table 6 ) loads the word which immediately follows it. It was 
not mentioned that 7 should also have the side-effect of incrementing the PC an 
extra time so that the following word is not subsequently executed as an instruction. 
However, 7 would appear to be almost entirely redundant since, if the following word 
is data (io = 0 ) then just executing the following word as an implied load-immediate 
would have exactly the same effect as 7. It does not appear that von Neumann 
considered the possibility of following 7 with an instruction word (*o = 1 ) so that 
instructions could be loaded by this means. 

Unconditional control transfer is provided by the ( instruction. Conditional trans¬ 
fers use the s operation (sign test) to select which address value will be moved to O ca . 
This address must then be stored into the address held of an immediately following ( 
instruction if an immediate transfer is to be made. However, the store could be made 
into a subsequent location in order to achieve some of the effects (and side-effects) of 
the delayed branch (RISC) operation. 
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Table 5: Operation code Definitions 


1. Unpooled types (t): 

order name 


a 

P 

7 

5 

e 

e 

C 

v 

2. a operations: 


CC operations (stack) 
load 

load immediate 
store 

store immediate 
CC move (stack) 
load PC (jump) 

I/O 


definition 

See 2. below 
M[/i, p\ —y I ca 
M [PC + 1 ] ->■ I ca 

0 C a M[l l - p] 

O ca ->• M[PC + 1] 
Oca t Ico 
M[p, p ] PC 
not defined 


modifier value 

operation definition 

w = 0 

+ 

(Ica + Jca) + 0 C a Oca 

1 

— 

( Ica Jca) T O 0 a t OcCL 

2 

X 

ij-ca X Jca) T Oca t Oc(X 

3 

/ 

{Jca/ Jca) T 0ca t OcCL 

4 


VIca + Oca Oca 

5 

i 

I ca -» Oca 

6 

j 

J ca Oca 

7 

s 

perform i or j depending on sign of Oca 

8 

db 

decimal —y binary 

9 

bd 

binary —y decimal 


3. Memory addressing: 


p[p 7 ...p Q ] 
p[p4--.po\ 
[p p\ 


major memory cycle (segment) 
minor memory cycle (word) 

13-bit memory address (word-address) 


Note: For store operations, if fij = 1 at the target address M[p,p] then only the 
[/i, p\ field is replaced by the high-order 13 bits of the operand. 


Table 6 shows the organization of the pooled orders. Table 7 is taken directly 
from the von Neumann Report [4], It shows the manner in which von Neumann 
summarized the logical definition of the machine. 
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Table 6: Pooled Orders 


type ( t') instruction format 

(least significant bit at right, 
bit width of each held indicated 
below each instruction) 


io = 0 

7 

ol T <5 

a + e 

ol T 6 

a 

P 

c 

v 


*o 



0 







1 






t' 

*0 


0 

1 






3 

1 

p 

P 


w 

c 

a 

*0 






i 

0 

8 

5 

10 

4 

1 

3 

1 




w 

c 

t' 

*0 




2 

1 



23 

4 

1 

3 

1 




w 

c 

? 

*0 




3 

1 



23 

4 

1 

3 

1 




w 

c 

? 

*0 




4 

1 



23 

4 

1 

3 

1 

P 

P 




t’ 

*0 




5 

1 

8 

5 



15 

3 

1 

P 

P 




a 

*0 




6 

1 

8 

5 



15 

3 

1 


not defined 


t' 

*0 


7 

1 


3 1 


definition meaning 


M[PC\ —» I m load immediate 


M[PC + !]—)■ I ca load immediate 


°p- 

O ca —■* Af[/i, p] 

stack operation; 
store 

OP ; stack operation; 

O ca —$■ M[PC + 1] store immediate 

OP; 

Oca ^ Ica 

stack operation; 
load 

OP 

stack operation 

M[p. p\ —> I ca 

load 

M[p , p\ -> PC 

control transfer 

I/O 

input-output 


Note: Von Neumann did not assign numeric codes to the 8 pooled orders. (At the 
end of the Report he indicated that he would do that next.) I have filled in 
numeric codes in the t' held in this Table just to make it more definite. Also, 
c is the O ca clear hag as defined in Table 4, and w is the CA order modifier 
as defined in Table 5. The CA processing how is shown in Figure 1 which is 
adapted from Figure 17 in the Report. 
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Table 7: Instruction Definition 


(I) 

Type 


(II) 

Meaning 


(HI) 

Short Symbol 


(IV) 

Code Symbol 


Minor cycle 

I = (i v ) = 

(* 0 * 1*2 • • • * 31 ) 


Standard 
Number 
or Order 

(7) 


Storage for the number defined by £ = *31*30 • • • *1 = 
X 0 uLiD 2 1,_31 (mod 2 ) —1 < £ < 1 . *31 is the sign: 0 
for +, 1 for —. If CC is connected to this minor cycle, 
then it operates as an order, causing the transfer of 
£ into I ca . This does not apply however if this minor 
cycle follows immediately upon an order w —> A or 
wh —> A. 




*o = 0 


Order 

(a) + (<5) 


Order 
(a) + (e) 


Order 

(a) + (0) 


Order to carry out the operation w in CA and to 
dispose of the result, w is from the list of 11.4. These 
are the operations of 11.4, with their current numbers 
w.decimal and w.binary, and their symbols w: 


w.decimal 

w.binary 

W 

w.decimal 

w.binary 

W 

0 

0000 

+ 

5 

0101 

i 

1 

0001 

- 

6 

0110 

j 

2 

0010 

X 

7 

0111 

s 

3 

0011 

= 

8 

1000 

db 

4 

0100 


9 

1001 

bd 


w —► /.ip 
or 

wh —> /tp 


w —> f 
or 

wh —> f 


w —► A 
or 

wh —> A 


*o = l 


Order 

(a) 


h means that the result is to be held in O ca . —> pp 
means that the result is to be transferred into the 
minor cycle p in the major cycle p; —> f, that it is 
to be transferred into the minor cycle immediately 
following upon the order e; —> A, that it is to be 
transferred into I ca \ no —that no disposal is wanted 
(apart from h). 


Order 


Order to transfer the number in the minor cycle p in 
the major cycle p into I ca . 


A pp 


Order 

(0 


Order to connect CC with the minor cycle p in the 
major cycle p. 


C -S— pp 
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4. vN-EDVAC Design 

Von Neumann developed both the architecture and the design of the vN-EDVAC 
based on detailed analysis of the performance and resource requirements of a number 
of computational problems. Normal instruction sequencing was intended to permit 
instruction execution at the rate at which data arrived from the output of a delay 
line. The length of the delay line was determined based on the assumption that the 
average delay for a memory reference after an arithmetic operation would be short 
as compared to the arithmetic time. The ability to retain intermediate results in the 
CA registers reduces the frequency of store and load operations which would, unless 
addresses were carefully chosen, take an average of half a major (delay line) cycle 
time. 

4.1 Memory Design 

The intended memory was to be made up of mercury delay lines. Each delay line 
contained 32 32-bit words. At a clock rate of IMhz the circulation time of the delay 
line was about 1ms. The size of the delay lines was determined from the fact that 1ms 
was approximately the arithmetic time of the CA unit. Thus, new operands would 
become available from the current delay line at about the time they would be needed 
by the CA. The spirit of this analysis was sound, but it neglected important factors 
including instruction fetch requirements. 

The size of the memory was determined after consideration of several possible 
numerical problems. In the course of the analysis of memory size and logic complexity, 
von Neumann remarks: “the decisive part of the device, determining more than any 
other part its feasibility, dimensions and cost, is the memory.” Based on this analysis, 
von Neumann settled on a total memory size of 8k words or 256 delay lines. 

4.2 E-elements and Logic 

This is, as far as I am aware, the first substantial work (since [14] Babbage) 1 which 
clearly separated logic design from implementation, and gave a formal scheme for 
logic representation and verification. 

However, it is a curious fact that the notation which was fully established and 
extensively used in this report was totally absent from subsequent works, for example 
Burks[8] and Eckert [10]. The simplest gate was drawn as 

1 I knew from the general literature such as [15] Hyman about Babbage’s difference 
and analytical engines, but I only recently obtained a copy of his fundamental paper 
“On a method of expressing by signs the action of machinery,” [14] Babbage 
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—o^- 

while an inverter was drawn as 

—o^- 

Note that the — • symbol is an “absolute” inhibitor. See Section 6.4 of [1] von 
Neumann. 


A two input OR gate would be 
The notation 

was used for a two input AND gate. A three input AND gate was given by 



Thus, the number inside the circle indicated the minimum number of active inputs 
required to drive the output active. It is noted that a two input AND gate would be 
defined by 



in terms of elementary gates. The the number of arrows on the output line indicated 
the number of unit delays (r) introduced by the element. The arrow notation also 
served to indicate the output line. Thus 

—CH—>- 

was used to indicate a construct with total delay of 2r, where r is the basic gate 
delay time. 

The notion of composition was clearly established, so that, for instance, the adder 
circuit 




h 


was subsequently represented by 

















18 


The Computer as von Neumann Planned it 





which was termed a block symbol. 

4.3 Complexity 

Due to the choice of purely serial and synchronous operation, it was expected that 
the logic (CA and CC) would require a few hundred vacuum tubes and the memory 
would require around 2000, for a total count of under 2500. 

5. Conclusion 

This paper has given a review of the nature of the design and of some of the innova¬ 
tions which were present in this first modern computer definition. 

The computer defined in the First Draft Report on the ED VAC was never built 
and its architecture and design seem now to have been forgotten. The Report was 
a fundamental influence on Turing’s work. When he was at NPL he proposed to 
build a computer [12 Turing] and in his introduction he pointed out that in order 
to understand his design the reader should read von Neumann’s First Draft Report. 
However, the Pilot ACE that Turing defined in [12 Turing] was not built to his 
specification. It was built after long delay caused by indecision on the part of the 
National Physical Laboratory, and well after Turing had left. 

It should be mentioned that, due to the continuing secrecy with respect to all 
Bletchley activities, the NPL staff could not have known that Turing had any practical 
experience of computers. However, the resulting Pilot Ace was specified and built by 
Harry Huskey from the Eckert and Mauchly group. When I asked Harry Huskey what 
he thought of the von Neumann Report he answered “I could not make head to tail 
of it.” Thus, even in this time, the von Neumann Report was not as influential as 
could have been expected. 
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